Greedy Approach to Reliable Disease Susceptibility Prediction

نویسندگان

  • Dumitru Brinza
  • Irina Astrovskaya
  • Alexander Zelikovsky
چکیده

One of the main problems in genetic epidemiology is to robustly predict genetic susceptibility to complex diseases based on the data from case/control studies. This becomes computationally challenging in presence of interactions between multiple genes. In order to efficiently search through enormous amount of possible combinations, it is necessary to apply heuristics and the greedy approach has been successfully validated on many real data. In this paper we modify the genotype covering phase of the model-fitting susceptibility prediction algorithm from [1]. We improve reliability of the previously known prediction method based on greedy approach by replacing clean genotype coverage with dirty coverage, i.e., clusters participating in coverage do not overlap or overlap respectively. We have leave-one/many-out cross-validated existed and proposed prediction methods on real case/control studies of four diseases (Chron’s disease, autoimmune disorder, tick-born encephalitis, and lung cancer). Our results show that relaxation of the clean coverage significantly improves reliability of the greedy based susceptibility prediction approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Message from the Poster Chairs

One of the main problems in genetic epidemiology is to robustly predict genetic susceptibility to complex diseases based on the data from case/control studies. This becomes computationally challenging in presence of interactions between multiple genes. In order to efficiently search through enormous amount of possible combinations, it is necessary to apply heuristics and the greedy approach has...

متن کامل

Discrete Algorithms for Analysis Of

Accessibility of high-throughput genotyping technology makes possible genomewide association studies for common complex diseases. When dealing with common diseases, it is necessary to search and analyze multiple independent causes resulted from interactions of multiple genes scattered over the entire genome. The optimization formulations for searching disease-associated risk/resistant factors a...

متن کامل

Combinatorial Methods for Disease Association Search and Susceptibility Prediction

Accessibility of high-throughput genotyping technology makes possible genome-wide association studies for common complex diseases. When dealing with common diseases, it is necessary to search and analyze multiple independent causes resulted from interactions of multiple genes scattered over the entire genome. This becomes computationally challenging since interaction even of pairs gene variatio...

متن کامل

Hybrid Method of Logistic Regression and Data Envelopment Analysis for Event Prediction: A Case Study (Stroke Disease)

Abstract Predictive analytics is an area of statistics that deals with extracting information from data and using it to predict trends and behavior patterns. Many mathematical modeling has been developed and used for prediction, and in some cases, they have been found to be very strong and reliable. This paper studies different mathematical and statistical approaches for events prediction. The ...

متن کامل

An Improved Junction-Based Directional Routing Protocol (IJDRP) for VANETs

Vehicular Ad-Hoc Networks (VANETs) is a novel technology that has recently emerged and due to its swift changing topology and high mobility nature, it has become problematic to design an efficient routing protocol in VANETs’ amongst both moving and stationary units. Also, the existing routing algorithms are not very effective to satisfy all requirements of VANETs. This paper explores the need o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007